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Abstract 

Dynamic spectrum sharing is a promising technology to improve spectrum utihzation in the 
future wireless networks. The flexible spectrum management provides new opportunities for li- 
censed primary user and unlicensed secondary users to reallocate the spectrum resource efficiently. 
In this paper, we present an oligopoly pricing framework for dynamic spectrum allocation in which 
the primary users sell excessive spectrum to the secondary users for monetary return. We present 
two approaches, the strict constraints (type-I) and the QoS penalty (type-II), to model the real- 
istic situation that the primary users have limited capacities. In the oligopoly model with strict 
constraints, we propose a low-complexity searching method to obtain the Nash Equilibrium and 
prove its uniqueness. When reduced to a duopoly game, we analytically show the interesting gaps 
in the leader-follower pricing strategy. In the QoS penalty based oligopoly model, a novel variable 
transformation method is developed to derive the unique Nash Equilibrium. When the market 
information is limited, we provide three myopically optimal algorithms "StrictBEST" , "StrictBR" 
and "QoSBEST" that enable price adjustment for duopoly primary users based on the Best Re- 
sponse Function (BRF) and the bounded rationality (BR) principles. Numerical results validate 
the effectiveness of our analysis and demonstrate the fast convergence of "StrictBEST" as well 
as "QoSBEST" to the Nash Equilibrium. For the "StrictBR" algorithm, we reveal the chaotic 
behaviors of dynamic price adaptation in response to the learning rates. 

Key words: Dynamic Spectrum Sharing, Oligopoly Pricing, Cognitive Radio, Nash Equilibrium, 
Best Response Function, Bounded Rationality, Bifurcation and Chaos. 



1. Introduction 

Wireless spectrum has become the scarcest resource due to the dramatic development of mobile 
telecommunication industry in the last decades. However, recent studies by the Federal Commu- 
nications Commission (FCC) show that traditional fixed allocation policy results in very low effi- 
ciency in radio spectrum utilization. The increasing spectrum demand, together with the resource 
scarcity, gives rise to the development of cognitive radio networks that enable dynamic spectrum 
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access. Within a dynamic spectrum access system, radio spectrum resources are allocated by agile 
management schemes in terms of spectrum market among the unlicensed (i.e., secondary) users and 
the licensed (i.e., primary) users [3|]. When the possessed spectrum is not fully utilized, a primary 
user has an opportunity to sell the excessive spectrum to the secondary users for monetary payoff. 
This is also referred to as spectrum trading mechanism in 0] which involves spectrum selling and 
purchasing processes. Therefore, it is natural to consider the spectrum allocation in the perspective 
of economic models and market strategies. 

In such an emerging network scenario, multiple primary users coexist in the same geographical 
site and compete for the access or the purchase of secondary users equipped with cognitive radios. 
Hence, an important problem for the spectrum trading is how the primary users set prices of per- 
unit spectrum in a competitive market. For example, if a primary user sets a very low price, it 
might result in the loss of revenue (or profit). On the contrary, if the price is set too expensive, 
the secondary users are inclined to purchase from other spectrum holders. Niyato et.al j^] initially 
introduce the oligopoly pricing theory to characterize the interactions between the spectrum abun- 
dant side (primary users) and the demanding side (secondary users). In the oligopoly spectrum 
market, a commonly used quadratic utility is adopted to quantify the spectrum demand of the 
secondary service, and each primary user aims to maximize the individual profit. In jH, Jia and 
Zhang study the competitions and dynamics of spectrum allocation in a duopoly market via a non- 
cooperative two-stage game. However, authors in [3] do not consider an important feature that a 
primary user usually has limited capacity to lease. Authors in mainly focus on the situation 
that both two wireless service providers have limited spectrum capacities in the price competition 
stage. In fact, the price competition in a more general oligopoly game is rather difficult to be 
analyzed when the constraints of spectrum capacity are incorporated. 

In this paper, we investigate the competitive pricing of a general oligopoly spectrum game. 
Distinguished from previous work, our study concentrates on the capacity-constrained pricing that 
is quite common to the primary users. To characterize the limitation in leasing spectrum resource, 
we employ two approaches: the strict constraints (type-I) and the QoS penalty functions (type-II). 
In the market model with type-I constraints, each primary user has a certain spectrum bound 
so that it might not be able to provide the best spectrum demand. We address the following 
challenging issues: a) is there a unique Nash Equilibrium (NE) in such a capacity-constrained 
spectrum game? b) if YES, how to find the NE efficiently? c) especially in duopoly games, 
what are the impacts of capacity constrains on the NEs and the system dynamics? In the type- 
II market model, the capacity constraint of a primary user is absorbed in the utility as a barrier 
penalty function. This corresponds to the scenario that the leased spectrum is transferred from the 
existing services of primary users. They offer spectrum for monetary return, however, at the cost of 
QoS decrease of primary services. Here, we model the QoS of a primary service as a function of the 
queueing delay. Generally, explicit solution does not exist in such an oligopoly spectrum allocation 
game. We present a novel method to discover the NE and to prove its uniqueness. Consider the 
fact that a primary user usually has no knowledge of the utilities and the price-demand functions of 
its opponents, we develop a set of price adjustment algorithms based on the best response dynamics 
and the bounded rationality principles. 

To summarize, our contributions are: 

• We formulate two oligopoly market models to characterize the capacity limitations: the strict 
constraints (type-I) and the QoS based penalty functions (type-II). Given the above market 
models, the primary users compete for revenue or utility maximization by deciding the prices 

2 



of per-unit spectrum. 

• In the type-I model, we propose a novel searching method to find a Nash Equilibrium and 
prove its uniqueness. Interestingly, we find the revenue gaps in the duopoly Stackelberg game 
with type-I constraints. 

• We present two algorithms, StrictBEST and StrictBR, to adjust prices dynamically based 
on the best response dynamics and the bounded rationality when the market information is 



• In the type-II model, we present an interesting variable transformation method to derive the 
Nash Equilibrium and prove its uniqueness. The QoSBEST algorithm is proposed to perform 
spectrum pricing based on the best response dynamics. 

• We demonstrate the nonlinear dynamic behaviors in the StrictBR algorithm when the learn- 
ing rates vary. 

The rest of this paper is organized as follows. In section [21 we present the system models of 
capacity-constrained spectrum market. In section [3] and HI we analyze the NEs of the noncoop- 
erative oligopoly market with type-I and type-II constrains respectively. Section [5] evaluates the 
analysis and performance of proposed schemes. We present an overview of related work in section 
[6] and conclude in section [71 

2. System Model 

In this section, we present mathematical models to characterize the dynamic spectrum alloca- 
tion in cognitive radio networks. To capture the realistic spectrum market, two types of capacity 
constraints are incorporated. 

2.1. Agile Spectrum Market 

We consider the cognitive radio network where multiple primary users (PUs) or wireless service 
providers (WSPs) compete for a shared pool of secondary users. The secondary users are the 
static/mobile devices equipped with cognitive radio technologies. The primary users are the in- 
frastructure based wireless operators or the licensed spectrum holders. They are usually treated as 
the spectrum brokers that lease the unused frequency to the secondary users for monetary payoff. 
We show the structure of a spectrum market in FiglH with a number of primary users and the 
common secondary users. In this spectrum market, the demands of secondary users depend on the 
prices of per-unit spectrum. Each primary user chooses its own price to compete for the secondary 
users' subscription. 

2.2. Secondary Users 

We characterize the spectrum demands of secondary users in the oligopoly market. Define 
Af = {1, 2, • • • , N} as the set of primary users. Let pi be the price of the i*^ primary user. Denote 
Qi to be the quantity of spectrum that secondary users buy from the i*'^ primary user. We define 
the utility of an average secondary user as a quadratic and concave function [H [2] : 



limited. 




i=i j^i 
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Figure 1: Spectrum Market Structure 



where Oj, Pi are positive constants for all i € M. Here, denotes the spectral efficiency of 
wireless transmission by a secondary user using the frequency owned by the i^^ primary user [3]. 
The spectral efficiencies of primary users can be the same, or different, depending on their center 
frequencies. For instance, if the center frequency of the i^^ primary user is high, secondary users 
may experience large path loss (or low spectral efficiency equivalently) when purchasing spectrum 
from this primary user. Similar to previous work, we also take the spectrum substitutability into 
account via the parameter /i. If is 0, a secondary user cannot switch among the primary users. 
When < /X < a secondary user can switch among the primary users depending on the spectral 
efficiency and the price of per-unit spectrum. For example, if one primary user increases its price 
of per-unit of spectrum, some of the secondary users may buy spectrum from other primary users, 
and vice versa. When ai = a and f3i = ix for all i G AA, the spectra of primary users are perfectly 
substitutive for secondary users. Taking the first-order derivative of the utility function with respect 
to qi and letting it be 0, we obtain the purchase price of secondary users from the i^^ primary user: 



Pi 



Oli 



Pi' 



(2) 



The concave function in Eqn.([T]) characterizes user satisfaction in terms of the purchased spectrum. 
In order to guarantee the concavity of utility function, its Jacobian matrix should be negative 
definite, that is. 



dujq) 
5q 



Pi n ... /i 
/i P2 ■■■ fJ- 



Pn 



<0, 



(3) 



where T denotes the above matrix. Note that the concavity of utility function is equivalent to 
the positive definiteness of T. Here, we present a necessary condition in Lemma [T] to set market 
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parameters. 



Lemma 1. The matrix T is positive definite if the market parameters has Pi > fj, > for all 

Proof: Please refer to the Appendix. ■ 
For the primary users, their strategies are to set prices of per-unit spectrum in the oligopoly 
market. Thus, the demand function can be expressed as the following: 



(4) 



where p is the price vector and q is the spectrum demand vector. Given the price vector, the 
spectrum demand of primary user i is rewritten by: 
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Qi = fiip) = ai- biPi + ^ CijPj, G Af, 



(5) 



where bi as well variables computed through Eqn.dl]) and = Oibi — ^ 



pecially, Cij = cji and c 



j^iCijOj. Es- 



-bi due to the symmetry of convert matrix. The market parameters 



transformed from Eqn.dH) satisfy the following property, 

Lemma 2. The parameters that characterize demand-price function in Eqn.Q, i.e. bi and Cij 
{i 7^ j), are positive, given the conditions Pi > fi > for all i,j G Af. 

Proof: Please refer to the Appendix. ■ 
In the market model, if the spectral efficiencies (i.e. Vi G M) are close to each other, one can 
guarantee the positivity of Oj by choosing Pi and fi appropriately. We give an example of spectrum 
market with only two primary users. The utility function results in linear inverse demand functions 



Pi 

P2 



ai 
a2 



Pm 
P2q2 



M2, 

Ml- 



The relationship between prices and demands can be represented by an alternative form: 

Qi = flip) = ai - bipi + cp2, 
Q2 = f2{p) = a2-b2P2 + cpi. 



(6) 
(7) 



(8) 
(9) 



According to Eqn. 
and c = n/{PiP2 - 



H), Oj and bi are calculated by: ai = {aiPj — aj ^) / {P1P2— ^i^) bi = Pj /{P1P2— n"^) 
fi"^) for i 7^ j, G {1,2}). To ensure the positivity of prices and demands, 
there have P1P2 > and OiPj > aj/i for i,j = 1 or 2. 

2.3. Primary Users and Bertrand Game Model 

The spectra leased by primary users are either unused or transferred from existing applications. 
The revenue of the i*^ primary user is a product of the leased spectrum qi and the price pi. In 
practical cognitive radio networks, the primary users cannot always satisfy the demands of the 
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spectrum market. Hence, capacity constraints should be taken into consideration when the primary 
users set the prices. In this paper, we formulate the oligopoly price competition as a Bertrand game. 
The players are the primary users, and the strategy of a primary user is the price of unit spectrum. 
Denote vrj to be the payoff (or utility) of the primary user i that could be cither the revenue or the 
profit. Let Wi be the spectrum size of the primary user and Bi be the traffic loads of primary 
services. The spectrum efficiency, r^ , is defined as the transmission rate per-unit spectrum for the 

i*^ primary service. The available spectrum for sale at player i is expressed as qf = Wi — Bi/r^\ 
We incorporate the capacity constraints into the primary users' payoff via two approaches: 

• Type-I Strict Capacity Constraints: The primary users aim to maximize their monetary 
revenues under the constraints of capacities. The local optimization model of the i^^ primary 
user is expressed as: 

maximize 7ri(pi, p_i) = pi ■ min{/i(p), qf}, (10) 
subject to Pi > 0, yi e M, (11) 

where p_j is the price vector excluding pi. 

• Type-II QoS Penalty Functions: An alternative approach is to translate the capacity con- 
straints as the barrier penalty functions to the revenues. Assume that the traffic pattern of a 
primary user is an i.d.d. poisson arrival process. The average queuing delay of a packet can 
be approximated by ^p^^ , which is adopted to reflect the quality of primary services. 
Therefore, the utility maximization of a primary user can be expressed as: 

Bi/r'-^^ 

maximize Tri{pi, p_i) = pi mm{fi{p),qf} - 9 log . _ -'r]/ x (12) 

% mm-j^jjl^p;, (/^ I 

subject to Pi > 0, G jV, (13) 

where the positive variable 6 is the weight of the lognormal M/M/1 queuing delay. 

Note that the capacity constraints in type-II have different implication from that in type- 
I. There might have multiple equilibrium points in the type-II model. Therefore, the infeasible 
solutions are excluded if they are outside of the capacity bounds. In the Bertrand game, the Nash 
Equilibrium is a vector of spectrum prices that no player can increase its payoff by changing its 
price unilaterally. 



3. Noncooperative Game with Type-I Capacity Constraints 

In this section, we first present the static game and the leader-follower game with Type-I 
capacity constraints, by assuming the availability of full market information of primary users. 
Furthermore, a dynamic game is formulated to characterize the interactions of price competition 
when such information is not available. 

3.1. Static Duopoly Game 

We commence the analysis by considering a duopoly spectrum market with two primary users, 
and then extend to a more general scenario. In the duopoly game, the NE price of a player is 
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obtained by assuming that tlie otlier player also chooses the best strategy. However, the ^^best" 
strategies of primary users are different in situations whether the spectrum capacities are sufficient 
or not. Since there are two primary users, the competitive pricing can be subdivided into four 
cases. Very recently, authors have analyzed the static games of these four cases and proved the 
existence of NE in Their analysis is based on graphical interpretation, which might be difficult 
to extend to a more general oligopoly game. Inspired by their work, we adopt a slightly different 
way to study the existence of NE in this section. In comparison with the only difference is 
the simplicity of analysis in this subsection. Later on, our method will be extended to prove the 
existence of unique NE in a spectrum game with more than two primary users. 
For the i*^ primary user, it decides the price pi so as to maximize its revenue 0] 

max Pi ■ mm{qf, ai - bipi + cpj], Vi, j G {1, 2}. (14) 

We first investigate the Nash Equilibrium when the available spectra are sufficient for both primary 
users (PUs). The revenues of PUl and PU2 can be written as: 

Mp) = -bliPl - -^^) + , (15) 

r \ ut a2 + cpi 2 , {a2 + cpif 
Mp) = -h2{P2 - + • (16) 

The best responses of PUl and PU2 are: 

^ cpl + ai ^ cp\ + a2 , . 

p' = ^br^ "'^^bT- ^'^^ 

Thus, the duopoly prices at the unique NE are: 

2ai62 + a2C ^ _ 20261 + aic 
bib2 - c2 ' ^2-7^ 

The spectrum demands at the unique NE can be expressed by: 



4oi 62 — 4oi 02 — 



^ 2aibib2 + bia2C ^ 2026162 + 62010 
'^^= 46162 -c2 ' ^^2= 46162 -c2 • ^''^ 



The optimal response of the unconstrained game corresponds to the Case 1 that qf and q'f satisfy 
(7^ > qI and (72 > Q2- Three other cases are also considered when the available spectra are not 
sufficient for the market demands. 

Case 2: > q^ and is sufficiently large. According to Eqn.p4|). the best revenue of PUl is 

2fei 



obtained at the point pi = ""^tb^^ when the capacity is large enough 



.1 > (20) 

Since we assume that q1 is less than the best spectrum demand, the revenue of PUl is expressed 
as TTi = piql if the following inequality holds: 

PI < "^"f (21) 
61 
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Here, we can easily find that this price bound is greater than the best response when PUl has 
a sufficiently large capacity. We next analyze the selfish pricing behavior of PUl. Because PUl 
can lease at most qf units of spectrum, it is inclined to increase pi for better monetary payoff 
until the spectrum demand is exactly equal to the capacity. For PU2, the optimal response is still 
characterized by Eqn. (jl7p so that it can benefit from the increase of pi. Hence, the prices at the 
Nash Equilibrium can be solved by 



t _ ai-qf + cp\ t _ «2 + cp[ 

Pi- , , and P2 - 2, ■ {.^^) 



The results are give by 



^1 " 26162 -c2 ' ^^-^^ 



The spectrum demand of PU2 is: 



t _ 62(0261 + aic - cqf) 
^2 - 26162 -c2 ' ^^^^ 

while that of PUl is exactly the capacity. Note that in Case 2, the capacity of PU2 must have 
qI < 92- 

Case 3: q2 > (?2 and qf is sufficiently large. The leased spectrum of PU2 reaches so that there 
exists 

Pi = °^'f + '^' . (26) 

Following the same method in Case 2, the NE prices of primary users are given by 

i _ 0162 + a2C - cq!l 
^1 " 26162 -c2 ' ^^^> 



t _ 20261 + QIC - 26igf 

^2 - 26162 -C2 ■ ^^^> 



The NE demand of PUl is expressed as: 

I _ bi{aib2 + a2C - cq^) 
- 26162 -c2 • 

Here, the capacity qf should be greater than q|. 

Case 4: There are two possible capacity sets in this case, qf > qi,q2 > ^2 9i — QijQ2 — 12- 
When both PUs cannot provide the best spectrum demands, they are disposed to increase the 
prices until the spectrum demands equal to the capacities. Because q^ and §2 are purchased by the 
secondary users, the prices of per-unit spectrum can be obtained based on the following equations: 

ql = ai- hip\ + cp\, 

ql = a2-b2pl + cp\. (30) 



One can easily obtain the root of above equations: 



Pi 



P2 



0162 + a2C - b2qf 



hh - c 
0261 + aic - h\q2 - cqf 



1«2 



(31) 



3.2. Static Oligopoly Game 

We study the existence of Nash Equilibrium in a more compUcated spectrum market. Consider 
a set of primary users M = {1,2, •••A^} where each of them has a capacity hmit. Two key 
challenges hinder us from finding the existence of NE. First, we do not know which primary users 
have insufficient capacities. Since the secondary users have preference towards the primary users, 
a smaller capacity does not necessarily mean the spectrum limitation compared with a larger one. 
Second, the interaction of prices is still not well studied in the Bertrand oligopoly market with 
capacity constraints. 

To carry out our study, we recap some findings in the duopoly spectrum market. A primary user 
is capacity-insufficient if the capacity is less than the best demand with unlimited spectrum. 
The capacity-insufficient primary user intends to increase the price until the capacity equals to the 
market demand. However, we might not be able to find the capacity-insufficient PUs once for all. 
When capacity-insufficient PUs increases their prices of per-unit spectrum, secondary users may 
go to other PUs, potentially leading to the lack of capacity in those PUs. Therefore, we need to 
search several time recursively to find the capacity-insufficient PUs. Inspired by the above findings, 
we can obtain the NE via the following steps. First, we compute the best reactions of all PUs 
without considering the capacity constraints. In this step, the i^^ PU decides the price by 



Pi 



+ ^jy^i CjjPj 

2bi 



(32) 



Denote to be the number of capacity-insufficient PUs in the k*^ search. We can find Mi 
primary users whose best spectrum demands exceed the capacities in the first search. Let us take 
the capacities of Mi PUs into consideration. The capacity-insufficient PUs have the incentive to 
increase their prices so as to lower down the spectrum demands: 



Pi 



ai - qt + Ej^i CijPj 



(33) 



The remaining PUs increase the prices correspondingly. The best reactions of primary users are 
solved through the equations in Eqn.(|32p and Eqn. (|33| ). Define a new matrix with the parameter 
Ml as 



Q(Mi 
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-C21 

— CMil 
-CMi+1,1 

-CNl 



-C12 
62 



— CMi2 
-CMi+1,2 

-CN2 



bMi 



2bM 



+1 



-ClN 
-C2N 

— CMiN 
-CMi+l,N 

2bN 



and a vector 



a(Mi) = [ai- a2-q2 ■■■ - Qm^ OAfi+i • • • 



Before identifying the capacity-insufficient PUs in the next step, we need to know whether Q(Afi) 
is invertible or not. 

Lemma 3. The matrix Q(Affc) is positive definite if /?j > /z > for alH G in the utihty function. 



Proof: Please refer to the Appendix. ■ 
Provided that Mi primary users are capacity- insufficient, the best responses can be solved via 

p= [Q(Mi)]"'-a(Mi). (34) 

For these AIi PUs, their capacities and the best demands have the following inequalities 

ai + ^djPj > 2q^. (35) 

The spectrum demands of primary user can be obtained using Eqn.dS]). In the new solution vector, 
we might observe that some additional primary users cannot lease the best spectrum demands. As 
a result, they attempt to raise the prices for better revenues. Assume that M2 players have limited 
capacities now, we replace the original Q(Mi) and a(Mi) by 



Q(M2) 



bi 

-C21 



-C12 
62 



-CM2I —CM22 

-CM2+1,1 -CM2+1,2 



-CNl 



-CN2 



&M2 



2b 



M2+1 



and 



a(M2) = [ai -q1 a2 - q^ 



0M2 



-ClN 
-C2N 

— CM2N 
-CM2+l,N 

2hN 



aN\ 



Here, an important question is whether the iterative search method can find more and more 
capacity-insufficient primary users? We must show that the search method will not leads to a 
deadlock. Before proving the nondecreasing property of search results, we introduce a crucial 
definition first. 

Definition 1. Stieltjes matrix [18]: A Stieltjes matrix is a real symmetric positive definite 
matrix with nonpositive off-diagonal entries. Every Stieltjes matrix is invertible to a nonsingular 
symmetric matrix with nonnegative entries. 

According to Lemma [T] and O one can easily find that Q(Mfc) is a Stieltjes matrix. In the following 
Lemma, we will show the nondecreasing property of M^. 
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Lemma 4. The set of capacity-insufficient primary users in the k — 1 step is a subset of that in 
the k^^ step. 

Proof: Please refer to the Appendix. ■ 

Using this method, we can find the primary users with capacity shortage iteratively. The 
proposed method has low computational complexity that requires at most N searching steps. 
Next, we will show that the oligopoly price vector computed above is a Nash Equilibrium. 

Theorem 1. The sets {M^} at subsequent steps of the search algorithm form a nondecreasing 
sequence. The limit of which is the set {M} such that the price vector computed by 

p* = [Q(M)]-i.a(M). (36) 

is a Nash Equilibrium of type-I oligopoly spectrum market. 

Proof: Assume that the primary users in the set A4 = {1, 2, • • • M} have insufficient capacities in 
respect to the best reactions. Their prices of per-unit spectrum are are determined by Eqn. ([55|) . 
The remaining primary users in the set Af\A4 adjust prices according to Eqn. ()32|) . First, we show 
that any player i € M \ M. has no incentive to adjust its price pi. Define a new price = p* ± A 
where A is a positive deviation from p* for i M\A4. The difference of the revenues between p* 
and is expressed as: 

= Piiai - biPi + ^ CijPj) - p*{ai - hip* + ^ djp*) 

= {Pi ± A)(ai - bi{p* ± A) + ^ djp*) - p*{ai - hp* + ^ ajp*) 

= -biA^ + (±A)(a, - 2hp* + Yl c^J^i) = -^i^' < 0- 

Thus, the i^^ primary user obtains smaller revenue if it deviates from the NE price. 
Next, we analyze the pricing strategies of the capacity-insufficient PUs. In the iterative scheme, 
one principle is that the best demand without capacity constraint is greater than qf for any i ^ M 
(i.e., in Eqn. (f35]) ). Consider the price Pi = p* — A, the revenue of the i*'' PU is: 

MPi,P~i) = Qi X {Pi - A) < 7ri{Pi,p*_i) 

because the spectrum demand q[ exceeds the capacity qf. If the i*^ PU chooses a price p'i = pI + A, 
the resulting revenue is: 

T^iiPi, P-i) = (Pi + - biPi -biA + Y djPj) 

= -^iiplpU) + Mat - 2bip* + djPj) - biA^ 
= 7r,(p*, plj + A{2qf - (a^ + ^ djPj)) - ^A^ 

< T^tiP*i,PU) - < ^^iP*i,P-^)■ 
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Since no player has the incentive to adjust its price, the price vector computed by Eqn. ([5H|) is a 
Nash Equihbrium of the ohgopoly spectrum game. I 
Another key question is that whether the NE found by our method is unique or not. We have 
the fohowing theorem for this question: 

Theorem 2. Consider a type-I ohgopoly spectrum market in Eqn.dS]), there exists a unique Nash 
Equilibrium. 

Proof: Please refer to the Appendix. ■ 

3.3. Leader- Follower Duopoly Game 

Up to this point we have considered basically static spectrum games in cognitive radio networks. 
However, the primary users may play different roles in the price competition. We now extend the 
noncooperative game to the leader-follower framework in which one primary user moves first and 
then the other moves sequentially. Consider a duopoly market, we show that the pricing strategies 
of primary users rely on their leader /follower roles and capacities. Similar to the static duopoly 
games, we also consider four cases in the leader-follower games. (The notations P*,pI,pI and p^ 
are reused in this subsection with superscripts ' and to stand for the roles of PUs.) 
Case 1: We consider the scenario that the PUs have enough spectra for the secondary users. Let 
PUl decide the price first and the PU2 decide afterwards. They aim to maximize their individual 
revenue. We use backward induction to find the subgame perfect NE according to the best response 
of PU2, ql*{qi), for every possible value of qi. Then, given that PUl knows PU2's best response, 
we obtain the best response of PUl. Substitute q2 by q2*{qi) in the revenue function of PUl, there 
has 

T^iiPiiPtiPi)) = Pi ■ (oi - ^iPi + ^ll-t^). (37) 



PUl maximizes its revenue at the point: 

_ 2aib2+ca2 , . 

^1 - 2(26i62-c2V ^^^> 

Then, the optimal price of PU2 is expressed as: 

pi* = ^'^^T^ofr 'ir"^^ - (39) 



4026162 - a2C^ + 2aib2C 



The spectrum demands of the primary users at the leader-follower NE are: 

= (40) 

_ 4026162 - 02620^ + 2ai6|c 

~ 462(26162 -C2) • 

Note that the capacities qf and should be greater than q[* and gg* respectively. 

Similarly, using the backward induction method, we can also obtain the price setting when PU2 
is the leading service provider. Comparing the leader-follower game with the static game, both the 
leader and the follower achieve higher prices as well as revenues. 

12 



Case 2: q| is sufficiently large, q'f > if PUl is the leader and q(* > q^ if PUl is the follower. 

The leader-follower NE depends on not only which primary user decides the price ffi'st, but 
also whether the leader is capacity-insufficient or not. As is mentioned in the static game, the 
best response of PUl is to set the price as pi = max{ °'^2bT^ ' To better understand the 

leader-follower interaction, we make use of Figl2]to illustrate the strategies of primary users. Point 
A represents the NE in the static game of Case 2. Because the best response of PUl is greater 
than its capacity, it is inclined to increase pi for better monetary payoff until is reached. In 
point A, the spectrum demand of PUl is exactly qf and the price is decided by pi = £L_^J:£E2._ 




Figure 2: Interactions between pi and p2 

Let PUl decide the price first and PU2 follow. As a leader, PUl knows that the follower's best 
response is p^^ = °'^2bT^ • Then, in the first stage, the revenue of PUl is computed by: 

TTi = min{qi, ai - bipi + cp2} ■ Pi (42) 

■ r o , c(a2 + cpi). 

= mm{g^,ai -bipi+ ' }-pi. (43) 

One can easily find that the best revenue is obtained at the point A if the leader has insufficient 
capacity. This is to say, the NE in the static game is also the NE in the leader-follower pricing 
when the capacity-insufficient PU acts as the leader. Here, the capacity constraints must satisfy 
qf < g^* and q^f > Q2* where (q'i*,(?2*) the unconstrained leader-follower market. 

Next, we will show that the leader-follower spectrum game exhibits a quite different strategy 
when PU2 is the leader. The PU2 has the complete information of the PUl's best response in the 
second stage. Because PUl falls short of spectrum, it will set a higher price such that the spectrum 
demand equals to the capacity: = £^i__^J:££2 ^ Then, the revenue of PU2 is: 

, , c(ai - q? + CP2). 

TT2 = P2 ■ {a2 - b2P2 + T )' (4^) 

The best price of PU2 is thus given by: 

it _ aic + a2h-cqf 
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Substitute p2 by P2 in the best response of p{\p2), we have the following expression of p(^: 

/t _ 2016162 - Qic^ - 2qfbib2 + c^qf + a2bic 
~ 261(6162 -c2) ■ 

This leader-follower A'^i? is illustrated at point B in Figl2j Comparing the NE equilibria A and B, 
we can see that the NE prices depend on the decision sequence of the primary users. When the 
primary user with sufficient spectrum is the leader, both of them have higher equilibrium prices. 
The purchased spectrum from the PU2 is given by 



^ — — — — - (47) 



02^1^2 - 02610^ + 0161620 - aic ^ - 6162C + c^Qi 
261(6162 -c2^ 



Likewise, the capacity of PU2 must have q2 > q2 ■ 
Case 3: qf is sufficiently large, qi^ > PU2 is the leader and q2* > PU2 is the follower. 

Following the method in Case 2, we can obtain the Nash Equilibria in the leader-follower games. 
We omit the solution process and summarize the results as below. 



PUl is the leader: 



PU2 is the leader: 



It _ Q2C + 0162 -cgf ^ 
2(6162 - C^j 

ft 2026162 - 020^ - 2gf 6162 + c^g^ + ai62C 
= 262(6162 -c2) • 



n _ 0162 + 02c -cgf ^ 

^1 " 26162 - C2 ' ^^"^ 



It _ 20261 + oic - 26igf 
^2 - 26162 -c2 ■ ^^^^ 



Case 4: The remaining capacity conditions exclude the those in the other three cases. We adopt 
the backward induction to find the leader-follower NEs. Let PUl be the leader and PU2 be the 
follower. Give the price pi, the best response of player 2 is = ''^^'^^^ if the capacity ^3 is less 
than ^2±Si. Provided that PUl knows PU2's best response, we can obtain the best price of PUl, 
p^i{P2^), so as to obtain the NE for this game. If PUl is also capacity- limited, the price Pi^(p2^) is 

set to -lb,+a2C-b,,f-cg-, ^^^^ 

price of PU2 is solved subsequently. The leader-follower game 
has the same NE as the static game. Furthermore, no matter which primary user is the leader, 
the NEs are the same when both of them are capacity-insufficient. 

3.4- Dynamic Duopoly Game with Best Response Dynamics 

The best responses of the spectrum game are obtained under the assumption that the primary 
users have a global knowledge of the demand functions and the capacity constraints. However, the 
primary users might only be able to observe the limited market information in practice. A primary 
user may update its price in the next round in response to the current prices of the opponents. In 
this subsection, we investigate how primary users interact with each other based on their individual 
best response functions. 
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In the duopoly Bertrand spectrum game, we assume that each primary user merely knows its 
demand function, capacity and the price of the opponent. Note that a player has no knowledge of 
the capacity of its opponent. The spectrum price of the i^^ primary user at time t is denoted as 
Pi{t), and that of the next slot is Pi{t + 1). Here, the "slot" defines the length of time that primary 
users adjust their prices. It can be one hour or one day, which is very flexible. Hence, in each 
slot, player i updates the price of per-unit spectrum according to the best response function in the 
static game: 



Pi{t + l) 



cpj{t)+ai 
2bi 



if £PlM±^ < qa i = 1,2, 



> qf for i = 1,2. 



(52) 



We then study the stability by assuming the updating rules are predetermined for the primary 
users. 

Case 1: Qi < qf and q2 < Q2- The update rule can be written in the matrix form: 



P(i + 1) 







c 
2bi 

" 



262 



• v{t) + 



ai 
2bi 

262 



(53) 



Denote Ai and A2 to be the eigenvalues of the transfer matrix in Eqn.(|53|). One can easily find 
that Ai and A2 are within the unit circle provided the feasibility constraint of the spectrum game: 
6162 > (? ■ According to the Routh-Hurvitz condition, the price adaptation scheme is stable. 
Case 2: q^ > and ^2 < Q2- ^^^^ updating rule in the matrix form is expressed as: 



Pit + 1) 



61 

262 ^ 



p(t) + 



61 

a2_ 

262 



(54) 



Under the condition 6162 > c^, the eigenvalues of the transfer matrix are also in the unit circle. 
Thus, the updating rule is stable. The price updating schemes are stable in Case 3 and Case 4. 
We omit the analysis since they are similar to Case 1 and Case 2. 

However, without the information of q*, player i might not know how to select the price 
adaptation rule. We introduce a simple algorithm named "StrictBEST" to update the prices 
without knowing the demand function of the opponent. In each iteration, the i*'' primary user 
determines the price by 



Pi{t + 1) = max{ 



cpj{t) + aj Qj - + cpj{t) 



}• 



(55) 



2hi h 

The above scheme has been used for the price adjustment of the two-stage game in We present 
the detailed proofs under different capacity constraints. However, the above proof is incomplete 
because the primary users might switch the price adjustment rules from time to time. This imposes 
great difficulty to prove the convergence of the rule in Eqn. (j55| ). Here, we present a conjecture on 
the StrictBEST algorithm. 

Conjecture 1. The StrictBEST algorithm converges to the unique Nash Equilibrium if the market 
parameters are positive as well as 61 > c and 62 > c. 

We propose a "potential" method to prove the above conjecture. This method needs to consider 
many cases, which can not be exhausted in this paper. An key observation is that the price of 
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a primary user is a function of its price two slot before. Thus, the StrictBEST algorithm might 
be able to converge if the prices of primary users get closer and closer to the NE every 2 slots. 
As we know, there are four types of capacity constraints (i.e. Case l'^4 this subsection). At 
time t, PUl and PU2 have four different strategy profiles, and at time t + 1, they also have four 
types of adjustment strategies, resulting a total number of 64 scenarios within two slots. Due to 
the complexity of this "potential" method, we do not prove the convergence property case by case. 
Here, we abuse to denote the equilibrium price of i^^ PU to be p*. The guidelines of the "potential" 
proof are summarized as below, 

• Prove that there exists only one equilibrium in Eqn. (j55p when the spectrum capacities qf 
and q| are given. 

• The distance between the price of user i and its equilibrium price is becoming smaller and 
smaller every two slots, that is, \pi{t + 2) — p*\ < \pi{t) — p*\- 

3.5. Dynamic Duopoly Game with Bounded Rationality 

In a practical spectrum market, a primary user may not be able to observe the profit gained 
by other primary services. Except the adjustment rule based on best response function, a primary 
user can also choose price for secondary users by learning the behaviors of other players from 
the history. Bounded rationality mimics the human behavior that players do not make perfectly 
rational decisions due to the limited information and their conservativeness. The notion of bounded 
rationality, also denoted as gradient dynamics, is employed in dynamic Cournot oligopoly models 
U) l3, 13]. Authors in [2] adopts bounded rational strategy to adjust spectrum price for the first 



time. The bounded rational rule is equivalent to a distributed algorithm that gradually approaches 
the equilibrium price. For instance, if a primary user is assumed to be a bounded rational player, 
though ignorant of the actual demand, it updates the price of per-unit spectrum based on a local 
estimate of the marginal profit [2]. In this subsection, we adopt the similar strategy to perform 
dynamic pricing in duopoly spectrum market. The major difference lies in that we concentrate on 
the interactions of duopoly primary users with capacity constraints. When the capacity is large 
enough, the i^^ PU adjusts its price according to the following rule: 

dn (t^ 

Pi(t + l)=pi{t)+jiPi{t)--^^, i = l,2, (56) 

where ji is the learning stepsize. The learning rate Pi{t) ■ captures the conservativeness of 

players that may not totally believe the observed market information. If player i cannot provide 
the optimal spectrum demand, it updates the price pi = °' ^^^^ in each period. 

In a self-mapping system, the most important issue is the stability property. We first analyze 
local stability of the dynamic spectrum sharing in Case 1. At the equilibrium point of price 
adaptation, there has p{t -|- 1) = p{t) = p*. Since Pi{t) is a self-mapping function, the fixed points 
can be obtained by solving the following equations: 

7iPi(ai - 2bipi + cp2) = 0, (57) 
72P2(a2 - 262P2 + cpi) = 0. (58) 

In this duopoly spectrum game with bounded rationality, the self-mapping dynamic system has 
four fixed points p*^^^ , p*^^^ , p*^'^^ and p*^^^ as follows: 
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.(2) 



p(^) = [0, 



262^' 

2aib2 + a2C 2a2bi + aic-^ 
46162 



C2 ' 46162 



and p(^) = [0,0]. Clearly, one can see the fixed point p^^^ without zero price is the Nash Equilib- 



rium. 



Here, we apply the "Routh-Hurvitz" condition to analyze the stability of these fixed points. At 
the equilibrium p(^), the Jacobian matrix of the self-mapping system is expressed as: 



J(p 



Jll(p(^)) Jl2(p(^)) 
J2l(pW) J22(p(')) 



1 + 7iai 
1 + 7202 



(59) 



The self-mapping system is stable only when the eigenvalues of the Jacobian matrix are in the unit 
circle. Because 7^ and are nonnegative, the eigenvalues are greater than 1. Hence, the fixed 
point p(i) is unstable in the self-mapping model. For the fixed point p^^), the Jacobian matrix is 



J(p 



{■2)- 



Jll(p(^)) Jl2(p(^)) 
J2l(p(')) J22(P^')) 



1 - 7iai 

l + 72(a2 + if)J 



71 "ic 
2bi 



(60) 



which means that p^^-* is not a stable equilibrium. Similarly, the fixed point p*-^) is not stable 
either. For the fixed point p(^) , the Jacobian matrix is 



J(p 



(4)1 



Jll(p(^)) Jl2(p(^)) 
J2l(p(^)) J22(p(^)) 



_ 2yibi(2aib2+a2c) 
72c(2a2fei+aic) 



",i(.:(2ai/j2+a2c) 
Abxb^-c^ 
272 b2 (2a2 bi+aic) 

46162—? 



The characteristic function of the Jacobian matrix is given by: 

A' - A(Jii(p(^)) + J22(p(^))) - Jl2(p(^)) • J2l(p(^)) = 0. 

The eigenvalues Ai and A2 are 



Al,2 



(Jll + J22) ± ^{Ju + J22f - 4Jl2 J2I 



(61) 



(62) 



(63) 



The fixed point p(^) is stable only when |Ai| and IA2I are within the unit circle. Given the spectrum 
market parameters and the learning rates 71,2, one can easily check the stability of the Nash 
Equilibrium. 

Next, we analyze the dynamics of price adaptation with bounded rationality in Case 2. Since 
PUl cannot supply the best spectrum demand of the secondary users, its price update follows the 
rulepi = £iz^J:£Ei. By letting p(t-|-l) = p(t), we solve the fixed points of the self-mapping system: 
p(i) = i] and p(2) = [?^-g^|H£_^, "^SX-Zl - Likewise, we apply the "Routh-Hurvitz" 

condition to the resulting Jacobian matrices. For the fixed point p(^), the Jacobian matrix is given 

by 



J(p 







c 

61 



1-1-72(02-1- 
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(64) 



Note that ai is greater than the available spectrum qf. Thus, the fixed point p^^^ is unstable. In 
the equilibrium point p^'^^ of Case 2, the Jacobian matrix is written as 

^ 1 

(65) 

2bi62-c2 ^ 26162-c^ 

Following Eqn. (j63p . we can obtain the eigenvalues Ai and A2, and validate the stability of the 
self-mapping system. The stability analyses of Case 3 and Case 4 are omitted because the Case 
3 is very similar to the Case 2 and the Case 4 adopts exactly the best response strategy. 

In the dynamic spectrum game, a primary user has no information of demand functions and 
capacities of its opponent. Thus, it is necessary to compare the prices upon the situations whether 
the capacity is sufficient or not. We present a distribute scheme, namely "StrictBR", for the price 
update with strict capacity constraints: 

Pi{t+1)= max{^i^^^-i^^, p,{t) + J^pi{t){ai - 2hiPi{t) + cpj{t))], (66) 

for i,j = 1 or 2. When the price happens to be in the iteration, the primary users need choose 
a small positive price randomly to leave the zero equilibrium point. 



J(p(^)) = 



4. Noncooperative Game with Type-II Capacity Constraints 

In this section, we analyze static and dynamic spectrum games with Type-II capacity con- 
straints. An iterative strategy is proposed to set prices by using local market information. 

^.1. Static Duopoly Game 

We analyze the competitive pricing of duopoly primary users who aim to maximize their utili- 
ties. The utility (or profit) is composed of two parts, the revenue and the delay-based cost. In the 
feasible region, the utility of the i^^ primary user is expressed as 

7^i{pi,P-i) =Pi{ai -biPi + cpj) -eiog— '—^ , Vi,j G {1,2}. 

qt -a^ + biPi - cpj 

To find the best response of the primary users, we differentiate the utility vTj with respect to pi and 
let the derivative be 0, there have 

Stti , 9b^ , , 

— = ai - 26ipi + cp2 + -i = 0, 67 

Pi q^-ai+ bipi - cp2 

dTTo , Obo , s 

— = 02 - 262P2 + cpi + = 0. 68 

P2 q%-a2^ b2P2 - cpi 



To reduce the complexity of expression in Eqn. (l67j) and (l68]) . we represent the prices pi using the 
spectrum demands qi in Eqn.(l2]) for i = 1,2. Thus, the above equations are transformed into 
follows: 

9b\ 

qi - bi{ai - I3iqi - iiq2) + — = 0, (69) 

Qi - Qi 

6b 

q2 - b2{a2 - P2q2 - mi) + = 0> (70) 
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where qi is within the feasible region S = {qi\0 <: Qi < qf,i = 1,2}. In the above nonhnear 
equations, the exphcit forms of qi and q2 cannot be solved directly. Submit qi in Eqn. (i69]) to 
Eqn. ([70|) . we have 

b,ai - {1 + Pibi)qi - -i^ 

7 ■ (1 + P2b2) + f^b2qi 

= 0202 ■ — — — eiT— • (71) 



^2 fj.bi 

We next show that there is only one solution in the range (0, qf). The left-hand expression can be 
further rewritten as: 

biai - gi - -1^ 1 

, ' ■ (1 + P2b2) --(/?!+ (3lp2b2 - /x'62). 

One can see that the left-hand expression is a strictly decreasing function of qi in the feasible 
region. The right-hand expression of Eqn. (j7ip is a strictly increasing function of qi in the range 
{0,qf). When qi approaches qf, the right-hand expression is approximated by 6202, while the 
left-hand expression is negatively infinite. Hence, there exist a unique feasible solution in Eqn. ()7ip 
only if the left-hand is greater than the right-hand at the point qi = 0. We can find the range of 
9 to guarantee the unique feasible solution for qi. Here, we only show that there exists a unique 
qi G (0, qf) in this duopoly market when 6 is small. In the point qi = 0, the difference between the 
left-hand and the right-hand is approximated by 

oi ^ ai(32b2 - a2&2/^ ^ ^ 

since 01^2 > 012^1 holds in the duopoly model. The above analysis also implies that there exists a 
unique Nash Equilibrium when 9 is sufficiently small. 

4-2. Static Oligopoly Game 

In this subsection, we extend the above analysis to a more general oligopoly spectrum game 
with type-II constraints. By introducing a novel variable transformation method, we analytically 
show the existence of unique NE. Given the utility of the i*'' primary user, we take the first-order 
derivative over pf. 



diTi , sr^ Obi ,„ , 

— = ai- 2biPi + c Vpi + -T-^ = 0. 72 

Pi ^ - Oi + biPi - c . pj 



Unlike the duopoly spectrum game, we cannot directly find the equations to solve pi for more 
than two primary users. To simplify the analysis, we substitute the variables pi by qi according to 
Eqn.([2|) for i G J\f. Formally, there exists 

qi - bi{ai - (3iqi - fj.y^^Pj) + ^ ' = 0, Vi € A/". (73) 

<l^ li 
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We denote a new variable Z = X^jg^ qi to be the total spectrum provision in the oligopoly market. 
Thus, the above equations can be transformed into: 

{l + Pibi-nbi)qi + fibiZ-aibi + ^^ = 0, Vie A/". (74) 

Assume that Z is a constant, the original coupled equations are converted into a set of separated 
quadratic formations: 

(1 + Pibi - nbi)qf - ((1 + Pibi - nbi)q'^ + aA - jibiZ)qi 
-{nbiZqt-aibiqt + Obi) = 0, ^i^N. (75) 

For the i*^ PU, qi has two roots: 

(1) ^ ((1 + Pibj - fibi)qf + ajbi - nbjZ) 

" 2{l +Pibi-iJibi) 

y((l + Pibi - nbi)qf + aA - iibiZy + 4(1 + PA - nhXfihZqf - aAqf + Obi) 



2(1 + PA - lih 

((1 + PA - l^bi)qf + aA - IJ^biZ) 





2(1 + PA - f^k) 






-(1 + PA - fibi)qf + aA - l^biZy + 4(1 + pA 


- fibi)Obi 


qp = 


2(1 + PA - lihi) 
((1 + PA - ^hi)qf + aA - l^hZ) 




2(1 + PA - fibi) 




Vi- 


-(1 + PA - fibi)q^ + aA - ^ib.Zf + 4(1 + PA 


- iibi)6bi 


2(1 + PA - fibi) 



(76) 



(77) 

We need to validate if the roots are within the feasible ranges. Let q^^ and q^^ minus g", we have 

(1) _ a _ (-(1 + PA - lJ'bi)q1 + aA - li-biZ) 
Qi Qi ~ 



and 



2(1 + PA - Ilk) 
V(-(l + PA - fibi)q^ + aA - f-^bjZy + 4(1 + pA - fibijObl 
2(1 + PA - tibi) 



(2) _ a ^ (-(1 + PA - l^bi)qf + aA - l^bjZ) 

" 2{1+ PA -fibi) 

y(-(l + PA - fibi)q^ + aA - IJibiZ)'^ + 4(1 + pA - fibi)ebi 



> 0, (78) 



< 0, (79) 



2(1 + PA - tibi) 

>e qf^ i 

by a function of Z. Define a set of functions hi{Z) = qf''' for i E M, we will show that hi{Z) is a 



(2) 

because Pi is greater than fi. When q^ is chosen to be for each primary user i, it is represented 
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decreasing function. Differentiate hi{Z) over Z, we have 
dZ 



-fibj^jl + PA - fibi)qf + aA - fibjZy + 4(1 + PA - fxb^)9bi 
2(1 + PA - fib^){^{-{l + PA - f^bi)qf + aA - ^ihZf + 4(1 + PA - iih)eh) 

^J'bi{-{l + PA - fJ'bi)qf + aA - fJ-bjZ) 

2(1 + PA - + (^ibi - f^bi)qf + aA - ^ihZf + 4(1 + PA - ^^h)9bi) 

< 0. (80) 

Sum up all the spectrum demands, we obtain the following self-mapping equation: 

N 

Z = Y,h{Z). (81) 

i=l 

The left-hand of the above one-dimensional function is strictly increasing, while the right-hand is 
a bounded and decreasing function. To guarantee the existence of unique solution, the right-hand 
should be greater than the left-hand when Z is 0: 

((1 + PA - l^bi)qf + aA) 
.^^ 2(1 + PA - f^h) 

N 



^ V(-(l + PA - f^bi)qf + aAY + 4(1 + PA - t^bi)0b. 



The above inequality has a much simpler necessary condition. By separating the inequality (I82p 
into N smaller inequalities for primary users, we obtain the necessary conditions aiqf > 9 for 
i G Af. The solution Z* can be solved numerically via binary search or golden search methods. 
Subsequently, the individual spectrum demands can be obtained through Eqn. (j77p . Since the 
demand qi is nonnegative, Z* and the market parameters must have ^biZqf — aAlf + < 0. 
Formally, we have the following theorem on the existence of unique NE: 

Theorem 3. Consider a type-II oligopoly spectrum market in Eqn. (ll2p with primary users. 
There exists a unique Nash Equilibrium if the following conditions hold: 

• Pi > fi for all i G M; 

• ai, Pi, fj,, Ui, bi > for all i G M, and Cij > for i,j G A/", i ^ j; 

• aiqf > 6* for i G A/"; 

• Given the unique solution Z* to Eqn. ifST]) . there has fibiZqf — aibiqf + Obi < for i G M. 

4-3. Dynamic Duopoly Game with Best Response Dynamics 

In the dynamic spectrum game, the revenue and the QoS of a primary user are not available 
to its opponent. Hence, the decision of prices is made based on the local utility function and the 
observed prices. We adopt the best response scheme to adjust the prices of per-unit spectrum. 
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Each player assumes that it is the only service provider in the spectrum market. According to 
Eqn. (f67|) and (f68ll . the price pi is a function of pj (Vz 7^ j) in each time period 

2bjpj + (2&i(g^ - ai - CP2) - bi{ai + cpa))^! 

-(ai+cp2)K-ai-ciJ2)- 61^ = 0, (83) 
26ipi + (262(9! - 02 - cpi) - 62(02 + cpi))p2 

- (as + cpi ) (g^ - a2 - cpi ) - 62^ = 0. (84) 

The prices pi and p2 can be easily solved by treating the opponent's price as a constant. Denote 
P^i\p2) and p^i\p2) to be the roots of Eqn. (f83ll . Let p'^\pi) and p'^\pi) be the roots of Eqn. lfSll) . 
A subsequent problem is how to select the "appropriate" prices for both players. According to the 
Eqn.Q and ([9]), player i computes the spectrum demands qi{pl^\pj)) and qi{pf'\pj)) for i ^ j- If 
one of the demands is in the range (0, qf"), the corresponding price is selected to lease the spectrum. 
The preceding analysis also shows that under certain market condition, there is only one feasible 
price for each primary user. Both of the demands might also be outside/inside of the range {0,qf) 
for some special reasons such as random noise etc. Under this situation, the price is chosen to 
receive better utility. We propose a distributed algorithm named "QoSBEST" to determine the 
prices, which can also be extended to a more general oligopoly market. The "QoSBEST" algorithm 
is specified in FiglSl 



1: Calculate the prices of per-unit spectrum in Eqn. ^83]) and 

2: )(i + 1) = a, - 5ipf ^(t + 1) + cpj{t), for i ^ j and k = 1,2; 

//predict the current demands 
3: If ^ < qf and ^ > qf 
4: p.(t)=pf^ 
5: elseif q^^^ > qf and qf^ < qf 

6: p,{t)^p^^; 
7: end 



Figure 3: QoSBEST Algorithm 

An important question is that whether the QoSBEST algorithm converges to the Nash Equi- 
librium. We first give a lemma that will be used in the proof and then present the complete proof 
of convergence. 

Lemma 5. For any real positive constants x, y and z, there have 

— (x + y) < \/x^+z^ — \f\p^^-^ < X -\-y. (85) 

and 

\l ^ z^ — y^y2 _|_ ^2 < x — y, H X > y. (86) 



Proof: Omitted due to its simplicity. 
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Theorem 4. The QoSBEST algorithm converges to the unique NE if the market parameters are 
positive as well as 61 > c > and 62 > c > 0. 

Proof: Please refer to the Appendix. ■ 
With the type-II constraints, there are multiple fixed points in the bounded rationality model. 
The marginal profit based iterative scheme may possibly converge to a fixed point outside of the 
feasible region. Hence, we avoid to consider the dynamic game with bounded rationality in this 
section. 



5. Performance Evaluation 

We present the numerical results to evaluate the price competition and the performance of 
adjustment strategies in the duopoly spectrum market. Distinguished from some previous work, 
several important dynamic behaviors are investigated such as bifurcation diagrams, strange attrac- 
tors and Lyapunov exponents. 

5.1. Static and Leader- Follower Games with Type-I Constraints 

We first study the NEs of static and leader-follower duopoly games under different settings. To 
perform the numerical analysis, we configure the parameters for the Bertrand demand functions: 

qi = 30MHz - 2pi + I.bp2\ 
q2 = mMHz -4p2 + l.5pi. 

where the unit of pi is dollar /MHz and that of Qi is MHz for i = 1,2. In this cognitive radio 
environment, the primary users have capacity constraints when leasing the unused spectra. Ac- 
cording to the best responses in the Type-I model, one can easily compute the optimal prices and 
spectrum demands without capacity constraints: p^ = 9.58,^2 = 5.55, = 19.16 and ^2 = 22.18. 
In the first experiment, we assume that the capacity of PU2 is large enough, while that of PUl is 
limited. When PUl's capacity increases from 4MHz to 24MHz, the prices and the revenues at the 
static NEs are shown in FiglHandEl With the increase of PUl's capacity, both of them tend to 
reduce the prices to compete for secondary users. Although the price of PUl descends, its revenue 
increases on the contrary due to the increased capacity. In term of PU2, its price and spectrum 
demand decrease until the corresponding values in Case I are met. Next, we analyze the prices 
and the revenues at the NEs when the capacity of PU2 is constrained by 15MHz and that of PUl 
increases from 4MHz to 24MHz. FiglB] shows that the prices of PUl and PU2 decrease when qf 
grows. In FiglTl the revenue of PU2 becomes smaller and smaller because the price p\ decreases 
while the spectrum demand is constrained by its capacity. 

We also illustrate the NEs of the leader-follower games. Let us first consider the case that 
both primary users have unlimited capacities. When PUl is the leader and PU2 is the follower, 
the prices at the NE are p* = 10.36 and P2 = 5.69. The corresponding spectrum demands are 
g* = 17.81 and ^2 = 22.77. When PU2 is the leader and PUl is the follower, the prices and 
the spectrum demands at the NE are obtained: p* = 9.75, P2 = 6, ql = 19.50 and ^2 = 20.63. 
Next, we study the NEs in the case that PU2 has sufficient spectrum and PUl's capacity increases 
from 4MHz to 24MHz. The prices and revenues are compared in FiglH] and [9] depending on which 
primary user is the leader. When the capacity of PUl is less than 18MHz, the primary users have 
better prices and revenues if PU2 plays the role of market leader. One can easily draw a conclusion 
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Figure 4: The change of prices when qf increases from 4MHz to 24MHz and §2 is sufficiently large 
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Figure 5: The change of revenues when increases from 4MHz to 24MHz and is sufficiently 

large 



that the primary user with sufficient capacity, instead of the capacity-insufficient one, is profitable 
to be the leader in the duopoly Bertrand game. When we further increase the PUl's capacity, the 
NEs become those in Case 1. 



5.2. Static Games with Type-II Constraints 

In this subsection, we simulate the competitive pricing of the static spectrum game with type-II 
constraints. Specifically, we evaluate the price of per-unit spectrum and utilities by varying the 
capacity constraints and the QoS coefficient 6. Note that the utility of primary users with type-II 
constraints contains 6log{Bi/k^^'') for i = 1, 2. Since they are constants, we only compare the parts 
in the utility functions that are related to the prices. 
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Figure 6: The change of prices when qf increases from 4MHz to 24MHz and ^2 is 15MHz 




Figure 7: The change of revenues when qf increases from 4MHz to 24MHz and ^2 is 15MHz 



In the first set of experiments, the coefficient 9 is set to 0.1, and the capacities of primary 
users increase from 4MHz to 24MHz. The prices of PUl at the NEs are shown in FigllOl The 
numerical experiments manifest that the prices of per-unit spectrum decreases with the increase 
of the primary users' capacities. Figjlll demonstrates the utihties of PUl, in which the utiUty is 
an increasing function with respect to the capacity of PU2. One can see that the utihties grow 
when PUl and PU2 increase their capacities. In the second set of experiments, we aim to explore 
the relationship between 9 and the price competition. Let and ^2 be 15MIIz. When 9 increases 
from 0.001 to 20, the prices of the primary users are shown in Fig|12l When 9 becomes larger, the 
primary users are inclined to increase the prices to reduce the utility loss caused by the penalty 
functions. 
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Figure 8: Prices verses PUl's Capacity in the Leader- Follower Game for Sufficiently Large 




Figure 9: Revenues verses PUl's Capacity in the Leader-Follower Game for Sufficiently Large (?2 



5.3. Dynamic Game 

We examine the dynamic behaviors of the noncooperative games with Type-I and Type-II 
constraints. Especially, the convergence rates of the proposed algorithms: StrictBEST, StrictBR 
and QoSBEST are evaluated. In the type-I model, we evaluate two settings that correspond to Case 
1 and Case 2 respectively: {q^ = WOMHz,q^ = lOOMHz} and {qf = 10MHz,q^ = WOMHz}. 
The QoS coefficient 9 is set to 0.1 in the QoSBEST scheme. The price adaptations of StrictBEST 
and QoSBEST are shown in Fig ll3lfT^ One can see that both StrictBEST and QoSBEST quickly 
converge to their individual equilibrium points. The dynamic adjustment of the StrictBR scheme 
is shown in Fig llStfTEl where the learning rates 71^2 are both set to 0.01 and 0.03. The convergence 
rate of StrictBR depends on the learning rates 71 and 72. By cross-comparing Figll5l and Fig ll6l 
we observe that the convergence rate of Case 2 is faster than that of Case 1. This is because the 
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Figure 10: The price of PUl in type-II model when both primary users increase spectrum 

capacities 
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Figure 11: The revenue of PUl in type-II model when both primary users increase spectrum 

capacities 



adjustment strategy of PUl does not have a conservative learning procedure in Case 2. In the 
StrictBR scheme, small learning rates can guarantee stability of the self-mapping system, however, 
at the cost of slow convergence speeds. 



5.4- Nonlinear Instability with Bounded Rationality 

We explore the nonlinear dynamics such as bifurcation and chaos in the type-I duopoly game 
with bounded rationality. These complex behaviors are important because they reveal how prices 
of primary users evolve over time and how initial conditions influence the results of spectrum 
allocation. As is shown above, the StrictBR scheme can be applied in Case 1,2,3. Thus, we only 
consider Case 1 and Case 2 in the numerical studies since Case 3 is similar to Case 2. 
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Figure 12: The price of PUs in type-II model when the parameter 9 varies 

Figl 171 shows the bifurcation diagram of Case 1 with respect to the learning rate 71. Here, the 
capacities of PUl and PU2 are both lOOMHz. The learning rate 72 is fixed to be 0.01 and the 
learning rate 71 increases from 0.01 to 0.09. The bifurcation diagram manifests that the attractor 
of Case 1 model is multivalued in term of parameter 71. One can also see in Fig[T7]that there exists 
a stable NE when 71 is less than 0.0511. As 71 further increases, the NE become unstable and 
infinitely periodic doubling that leads to chaos eventually. The bifurcation diagram of Case 2 with 
respect to 72 is illustrated in FigllSI where the capacity bounds are qf =10MHz and =100MHz. 
The learning rate 71 is 0.01 and the learning rate 72 grows from 0.01 to 0.06. When 72 is less than 
0.0331, the StrictBR scheme converges to the unique NE of Case 2 duopoly model. 

We show the graphs of strange attractors for Case 1 with the parameter constellation (71,72) = 
(0.07,0.02) in FigfTOl and for Case 2 with the parameter constellation (71,72) = (0.01,0.06) in 



Figl20[ Especially, Figl20l exhibits a fractal structure similar to Henon attractor l4 ]. 

The Lyapunov exponent of a dynamical system characterizes the rate of separation of infinites- 
imally close trajectories. To analyze the parameter settings in which aperiodic behaviors occur, we 
compute the maximal Lyapunov exponents for the learning rates. If the maximal Lyapunov expo- 
nent is positive, the duopoly game with bounded rationality is chaotic. For Case 1, the maximal 
Lyapunov exponent is shown in Figl21l as a function of the learning rate 71. When 71 is 0.0511, 
the maximal Lyapunov exponent becomes positive, which causes the first periodic doubling bi- 
furcation in Fig|17l When 71 is greater than 0.0671, the maximal Lyapunov exponent is greater 
than 0. This indicates that the self-mapping price adaptation is a chaotic system. In Figl221 we 
display the maximal Lyapunov exponent of Case 2 with respect to the learning rate 72. Here, the 
learning rate 71 is set to 0.01. When the learning rate 72 is around 0.0331, the duopoly game in 
Case 2 meets the first doubling bifurcation. With the increase of 72, the dynamic price adaptation 
becomes chaotic. 



6. Related Work 

The rapid development of wireless communication systems in the past two decades have resulted 
in the great needs of a finite and scarce resource: wireless spectrum. On the other hand, existing 
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Figure 14: QoSBEST: Iteration of Prices 



wireless devices operate in the fixed frequency bands, which can be very inefficient in terms of 
spectrum utilization. The research carried out by FCC shows that temporal and geographical 
variations in the utihzation of the assigned spectrum range from 15% to 85% As a promising 
technology, dynamic spectrum access is brought forward in the design of next generation wireless 
communication systems. The under-utilized spectrum bands can be detected and exploited by the 
users equipped with cognitive radios. For the detailed information, interested readers can refer to 
recent surveys in [B] and @]. 

One key feature of dynamic spectrum access is how the primary users (or wireless service 
provides) and the secondary users (or end users) share the spectrum efficiently and fairly. In 
particular, the dynamic spectrum sharing may involve selling and purchasing processes. Thus, it 
is natural to study the interactions of network components for dynamic spectrum sharing from 
the perspective of economics. The existing work can be mainly grouped into two classes: auction- 
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Figure 16: StrictBR: Iteration of Prices: {qf = WMHz, q^ = lOOMHz) 

based Baa and price-based [3. Eol. 13] . Authors in the seminal work [7] target at the dishonest 
bidding issues in an eBay-like dynamic spectrum market. A truthful and computationally efficient 
auction mechanism is presented toperform dynamic spectrum allocation. To maximize revenue and 
spectrum utilization, authors in [20] propose a real-time spectrum auction framework to distribute 
spectrum among a large number wireless users under interference constraints. Zhu and Liu 0] 
propose an auction-based collusion-resistant dynamic spectrum allocation approach to combat 
user collusion in cognitive wireless networks. An economic framework is also presented in 
to model the spectrum allocation to wireless service providers (WSPs) and the interaction of of 
end users with the WSPs, but the competition among WSPs is not the focus. Leveraging on 



microeconomics inspired mechanisms, authors in jl9[] develop both bargaining and auction based 
mechanisms to find the most optimized allocation pattern for a given area and allocation duration. 
Some other auction based spectrum sharing mechanisms can be found in [j^ . [2^ . [25I . [2^ . In 
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Figure 18: Bifurcation Diagram: qf = lOMHz and = lOOMHz 



the price-based class, Niyato and Hossain [3] introduce the oligopoly pricing theory to characterize 
the interactions between the primary users and the secondary users. In the oligopoly spectrum 
market, a commonly used quadratic utility is adopted to quantify the spectrum demand of the 
secondary service, and each primary user aims to maximize the individual profit. In another work 
[lo| . they consider the dynamic spectrum sharing among a primary user and multiple secondary 
users. They formulate the problem as an oligopoly market competition and use a noncooperative 
game to obtain the Nash Equilibrium. Very recently in an important work [^], Jia and Zhang 
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Figure 19: A Strange Attractor for the FoHowing Parameters: Initial pi = 5, Initial p2 = 5, 
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Figure 20: A Strange Attractor for the Following Parameters: Initial pi = 5, Initial p2 = 5, 
71 = 0.01, 72 = 0.06, qf = WMHz and q^ = WOMHz 



formulate the price and the spectrum competitions as a two-stage non-cooperative game that is 
inspired by the theoretic analyses of Cournot and Bertrand games yjj, iJ/T,] . In [21j , authors consider 
a primary user employing CDMA at the physical layer who aims to lease its spectrum within 
a certain geographic subregion. [2^ studies a revenue maximization problem in a Stackelberg 
game, where spectrum owner, primary users and secondary users are the players for opportunistic 
spectrum access. Besides, authors in 27| build a game theoretic model to investigate whether light 
regulation in the form of etiquette protocols, device design and bargaining amongst users can avoid 
the tragedy of common in unlicensed spectrum. In terms of nonlinear dynamics in the economics, 
authors in [l5(] have shown that the bounded rationality can cause chaotic behaviors in a Cournot 
duopoly. 
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Figure 22: Lyapunov Exponent: = lOMHz and ^2 = WOMHz 
7. Conclusion 

This paper suggests an economic framework for dynamic spectrum allocation in the emerging 
cognitive radio networks. The primary users serve as the spectrum brokers that lease the excessive 
frequency to the secondary users for monetary payoff. We present oligopoly Bertrand market 
models to characterize the capacity-limited spectrum sharing with two types of constraints: the 
strict constraints (type-I) and the QoS penalty functions (type-II). In the type-I oligopoly market, 
we present a low-complexity scheme to search the NE and prove its uniqueness. Especially, when 
the number of primary users reduces to two, we demonstrate the interesting revenue gaps in 
the leader-follower game. Two iterative algorithm, StrictBEST and StrictBR, are presented to 
adjust the prices when the primary users only possesses the local market information. In the 
type-II model, we prove the existence of unique NE and propose a price updating algorithm 
named QoSBEST. Numerical examples validate our analysis and manifest the effectiveness of our 
proposals. In particular, we experimentally show the representative nonlinear dynamics in the 
StrictBR algorithm such as bifurcations, chaotic maps as well as Lyapunov exponents. Our future 
research will be placed on the competitive pricing in more complicated markets, e.g. the number 
of active primary users are not deterministic. 
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APPENDIX 



Lemma [1} The matrix T is positive definite if the market parameters has /3j > /i > for all 

ieAf. 

Proof: Given a x 1 nonzero, real vector x, there has, 



x'^Tx = x'^ 



(3i /i 



1 1 ... 1 
1 1 ... 1 



1 1 
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/3i -/u 
132 -^^ 
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(87) 



i=l 



2=1 



Hence, the matrix T is positive definite if /3j > /i > for all i € Af. 



Lemma [2} T/ie parameters that characterize demand-price function in Eqn.^, i.e. hi and Cij 

{i 7^ j), are positive, given the conditions Pi > fi > for all i,j E M. 

Proof: We use Cramer's rule to compute the invertible matrix of T as follows. 
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where |T| is the determinant of T and Ujj is the matrix cofactor. Then, the variable bi can be 
expressed as bi = The variables Cij can be written as Cij = when i ^ j. Since T is positive 
definite, |T| is greater than 0. The cofactor JJa is 



1 

ItI 



Pi 



P 



i-l 



Pi+l 



Pn 



where all the elements except diagonal ones in the above determinant are ^. According to Lemma 
[U one can easily find 6i > for all i € N . Similarly, we can also prove that Ujj is negative for all 
i,j € M and i ^ j. Thus, the market parameters Cij are all positive for i,j £ M and i ^ j. The 
only difference lies in that we need to exchange certain columns in the determinant before applying 
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Lemma [TJ 



Lemma[3t The matrix Q(Mfc) is positive definite if Pi > fi > for alii Af in the utility function. 
Proof: We prove this lemma by contradiction. 
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We assume that the matrix Q(Mk) is singular. Thus, there exists a non-zero vector x that 
has x-^(5(Affc)x = 0. Because the matrix T is positive definite, it inverse matrix is also positive 
definite. We then rewrite x'^(5(-^fe)x 



N 



x^Q{Mk)x = x'^T-^x + bixj > x'^T^^x > 0. 



for any non-zero vector x. 



Lemma [4l The set of capacity-insufficient primary users in the k — step is a subset of that in 
the k^^ step. 

Proof: Denote and {Mk} to be the number and the set of capacity-insufficient primary users 
in the k^^ search respectively. Denote p-'^^ to be the price of primary user i in the k^^ search. At 
the beginning, Mq is equal to 0. 

Li the first search, there must have Mi > 0. Otherwise, all the primary users have sufficient 
capacities. Without loss of generality, we look at the the /c*^ search result. 

The k^^ search is based on a priori knowledge that the primary users in the set {Mfc_i} are 
capacity-insufficient. We assume that the k^^ search finds out capacity-insufficient primary 
users. Hence, the newly found primary users satisfy 



(89) 



Because the price strategy of the i^^ new capacity-insufficient PU is — ^ , the above 



(k) 

equality is equivalent to bip) > qf. 
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Next, we compare the price vector of the k^^ and the k — 1^^ searches. According to Eqn. 
alternative form of price difference is expressed as 



an 



Q(Mfe_i)-(pW-p 
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Submit the conditions h^qf'^ > qf for all i = Mfc_i + 1, • • • , to the above equation, we obtain 



Q(Mfc_i).(pW-p^''-^^)>0. 



(fc-l)N 



(91) 



Because Q(Mfc_i) is a Stieltjes matrix, it is inverse nonnegative. Therefore, all elements in the 
vector p^'^) — p^'^"^) are nonnegative. This means that the prices of all primary users do not de- 
crease in each search. For the primary users in the set Mk-i, their optimal spectrum demands 

, (fe) 

are — Ml_!£.J_^ which is also nondecreasing. To sum up, when a primary user is found to be 

capacity-insufficient in the k — 1^^ round, it still lacks of capacity in the next search. ■ 

Theorem [2j Consider a type-I oligopoly spectrum market in Eqn. there exists a unique Nash 
Equilibrium. 

Proof: We prove this theorem via two steps by contradiction. First, we will show that there exists 
a unique NE if the capacity-insufficient PUs are unchangeable. In the second step, we prove that 
the set of PUs that have insufficient capacities is unique in the oligopoly market. 

As is mentioned earlier, a selfish PU decides the prices according to the rule Eqn. ([32]) if the 
capacity is less than the best demand, and the rule Eqn.(|33p otherwise. Provided a market with 
N PUs, we can find that M of them are capacity-insufficient through the proposed search method. 
The price vector at the NE, p* , can be computed by 

p* = [Q(Af)]-i.a(M), 

where Q is a positive-definite matrix. Hence, when the set of capacity-insufficient PUs are deter- 
mined, there is a unique Nash Equilibrium. 

Next, we demonstrate that there has a unique set of capacity- insufficient primary users. The 
primary users in the set M are grouped into four mutually exclusive classes: A/i, A/2, A/3 and M^. 
PUs in the sets Ai and A'2 are capacity-insufficient in the iterative search (i.e. Ai = A/i U A'2), but 
the PUs in the sets A/a and have enough capacities. The price vector at the NE is denoted as 
P* = {phPh ' ' ' ^P*n}- PVs must have 
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Assume that there exists a different set of PUs that are also capacity-insufficient. For general 
purpose, we express this new set as A/i U A/3 and the capacity-sufficient set as N2 U M^,. Note 
that the above PU sets can be empty, but has a union of TV. Since there has a different set of 
capacity-insufficient PUs, we can find another NE price vector pT that also have 

>(?f ^ hp\>q'i ViGAAi; (96) 

>CLi ^ bip\>qt yieMs; (97) 

<qt bipl<qt ViGAAs; (98) 

<qt hp\<q'i yieAf^. (99) 





+ EjVi 






2 






+ EjVi 






2 




m 


+ EjVi 






2 










2 



Here, one can easily find there have > p* for i G A/a and > pj for i G A2. According to the 
market model, the prices at the NEs can be expressed in terms of spectrum demands q* and qj. 
Then, p] and p| in the sets J\f2 and M4 are written by. 



* 

Pi 


= a.i 


-(A 


- m)9? 








-A-E 


* 


- E 


G AA2; 


(100) 
















jeA/'a 
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Pi 


= a.i 


-(A 


- m)9* 




-"E 


«° 


-"E 


^1 


- /i E '^i' 


Vi G AA3; 


(101) 


























Pi 


= OLi 


-(A 


- m)?! 


-^E'^° 


E 




E ' 




E '^l' 


Vi G AA2; 


(102) 


























pI 


= Oti 


-(A 


- M)gf 




-"E 


^] 


-/^ E ' 


a 
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Vi G AA3. 


(103) 



























Recall that the conditions in Eqn. (f93]) . (fMl) . ([971) and ([98]) present the results: p\ > p* for i G A/s 
and p* > p\ for i G M2- Hence, we have the following inequality 

P* + p\ > pI + P*i Vi G A/'a, / G AAj. (104) 
Submit Eqn. (jl00p - (jl03p to Eqn. (jl04p and cancel out the common items, we obtain the inequality 

(ft - M)gJ + (A - > {Pi - f^)qt + (A - f^)qf g A/^a, / g aAj. (io5) 

Obviously, the above inequality is not true provided the market conditions Pi > for i G M. Thus, 
there is a unique set of capacity-insufficient primary users, resulting in a unique Nash Equilibrium 
in the oligopoly spectrum game. ■ 

Theorem |3t The QoSBEST algorithm converges to the unique NE if the market parameters are 
positive as well as bi > c > and 62 > c > 0. 

Proof: Assume the NE prices of PUl and PU2 are pi and P2- We start from time t and solve 
the equation in Eqn. ([83|1 . Although pi{t + 1) in Eqn. (i83]) has two roots, one can easily check 
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their feasibility by submitting these roots to the equation qi{t) = ai — bipi{t + 1) + cp2{t). After 
excluding the infeasible root, the unique price of PUl is expressed as 



Pi{t + 1) = , 

and the optimal price of PUl is written as 



, (3ai + 3cp*2 - 2q1) + J{ai + cp*^ - 2qfy + 861 
= ifel ' 

In order to compare pi{t + 1) and p^, we consider four cases with regard to different P2{t) and P2- 
Case 1: oi + cp2(t) - 2qf > and ai + cp^ - 2qf > 0; 
The difference between pi{t + 1) and p^ is, 



,,^1, * HP2{t)-p*2) , V{ai + cp2{t)-2qfy+8he-y/{a^+cp*2-2qfy + 8ht 
pi{t + i) -pi = — h 



46i 4bi 

If P2{t) > there has ai + cp2{t) — 2qf > oi + — > 0. According to Lemma[Sl the following 
inequality holds, 

Mt + i)-K<^^^^^^^f^. (106) 

On the other hand, if p2(*) < P2j there has < ai +cp2{t) — 2qf < ai ■^cp\ — 2qf. Based on Lemma 
O we can obtain the following inequality, 

P.it + l)-pl> <P'^'l-P'^\ (107) 

Combine Eqn. (|106p and (|107p together, we have, 

\pi{t + l)-pl\<^\p2it)-p*2\. (108) 

Case 2: oi + cp2it) - 2qf < and ai + cp2 - 2qf < 0; 

If P2{t) < P2, there has < 2qf — (oi + CP2) < 2qf — (ai + cp2{t)). Similarly, we calculate the 
difference between pi{t + 1) and p^, 

< p,it +l)-pl< 'M^^. (109) 
Otherwise, \i p2{t) > P2, the difference between pi{t + 1) and p* satisfies, 

^^^^^ < Pi(^ + 1) - K < (110) 

The above analysis manifests that the following inequalities hold, 

^Jp2it)-P*2\ < \Pl{t + l)-pl\<^^\p2{t)-p*2\. (Ill) 
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Case 3: ai + cp2{t) - 2qf > and ai + cp^ - 2qf < 0; 

The case 3 also implies P2it) > P2- According to Lemma [5l the following inequalities hold, 

- c{p2{t) - p*2) < («i + cp2{t) - 2q1f + ShiO - ^ {2qf - (ai + cp*^))^ + 8h0 < c{p2{t) - pi). 
Therefore, the difference between pi{t + 1) and p\ satisfies, 

I^^\p2{t)-Pl\ < \pi{t + l)-pl\ < ^^\p2{t)-pl\. (112) 

Case 4: ai + cp2{t) - 2qf < and ai + cp^ - 2q^ > 0; 

This case implies P2{t) < pi- Using the similar method as that in Case 3, we obtain the same 
inequality in pi2|) . Combine all the analytic results together, we can see the distance between 
pi{t + 1) and pI has \pi{t + 1) — p*| < ^\p2{t) — pl\- Using the similar steps, we can easily find 
that in slot {t + 1), the following inequality holds, 

2 2 

\p^{t + 2)-pl\<-^\p,{t + l)-pl\ < ^\p^{t)-pl\ < (^)l|p2(0)-p^|, (113) 

02 O1O2 O1O2 

where ^2(0) is the initial price of PU2. Given the conditions bi > c and 62 > c, the QoSBEST 
algorithm converges to the unique NE. 
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